Towards Multi-modal Hearing Aid Design and Evaluation in Realistic Audio-Visual Settings: Challenges and Opportunities
نویسندگان
چکیده
A limited number of research developments in the field of speech enhancement have been implemented into commercially available hearing-aids. However, even sophisticated aids remain ineffective in environments where there is overwhelming noise present. Human performance in such situations is known to be dependent upon input from both the aural and visual senses that are then combined by sophisticated multi-level integration strategies. In this paper, we consider the opportunities and challenges presented by hearing-aid development in an audio-visual (AV) speech context. First, we posit the case for new multimodal AV algorithms that enhance speech quality and intelligibility with the aid of video input and low-latency combination of audio and visual speech information. Second, we consider the challenges that the AV setting presents to hearing aid evaluation. We argue that to meaningfully reflect everyday usage, hearing aid evaluation needs to be performed in an audio-visual setting regardless of whether hearing aids are directly using visual information themselves. We consider the need for new AV speech in noise listening tests, and for research into techniques for predicting objective AV speech quality and intelligibility. Finally, an AV speech enhancement evaluation challenge is proposed as a starting point for stakeholder discussion.
منابع مشابه
Causes of Spelling Weaknesses in Students with Visual Impairment and Teaching Strategies for Spelling Improvement
Background: Spelling is characterized as a basic skill for the children’s writing literacy. A wide range of factors may contribute to the formation and/ or intensification of problems related to teaching writing competencies, so that spellings come with a serious challenge called “invented spelling”. It is also a major concern for teachers and parents of children with visual impairments. Theref...
متن کاملTeam-Based Integrated Knowledge Translation for Enhancing Quality of Life in Long-term Care Settings: A Multi-method, Multi-sectoral Research Design
Multi-sectoral, interdisciplinary health research is increasingly recognizing integrated knowledge translation (iKT) as essential. It is characterized by diverse research partnerships, and iterative knowledge engagement, translation processes and democratized knowledge production. This paper reviews the methodological complexity and decision-making of a large iKT projec...
متن کاملThe teleface project multi-modal speech-communication for the hearing impaired
The Teleface Project, a project that aims at evaluating the possibilities for a telephone communication aid for hard of hearing persons, is presented as well as the different parts of the project: audio-visual speech synthesis, visual speech measurement and multimodal speech intelligibility studies. The experiments showed a noticeable intelligibility advantage for the addition of the face infor...
متن کاملInvestigating Sound Intensity Gradients as Feedback for Embodied Learning
This paper explores an intensity-based approach to sound feedback in systems for embodied learning. We describe a theoretical framework, design guidelines, and the implementation of and results from an informant workshop. The specific context of embodied activity is considered in light of the challenges of designing meaningful sound feedback, and a design approach is shown to be a generative wa...
متن کاملAn Evaluation of Multi-Modal User Interface Elements for Tablet-Based Robot Teleoperation
For robot teleoperation systems, tablet and smart phone user interfaces provide portability and accessibility to allow anyone anywhere to connect to the system quickly and easily. This can be highly advantageous for disaster-relief robot systems where timing is critical. However, the small screen size and unconventional input methods mean that traditional teleoperation user interface elements, ...
متن کامل